A NOTE ON THE Sopfr(n) FUNCTION. 



RUSLAN SHARIPOV 

Abstract. The Sopfr(n) function is defined as the sum of prime factors of n each of 
which is taken with its multiplicity. This function is studied numerically. The analogy 
between Sopfr(n) and the primes distribution function is drawn and some conjectures 
for prime numbers formulated in terms of the Sopfr(n) function are suggested. 



1. Introduction. 

The Sopfr(n) function is defined as the sum of prime factors of its positive 
integer argument n (see [1]). For n = 1 this function is defined to be equal to zero: 
Sopfr(l) = 0. If n is prime, then Sopfr(rt) = n. If n is a product of prime numbers 

" /<V •••••/<:. (i.i) 

then Sopfr(n) is calculated as the sum 

Sopfr(n) = kiPi + ... + k s p s . (1-2) 

Note that the prime factors p±, .. . , p a in the sum (1.2) are taken with their 
multiplicities k\, . . . , k s in the expansion (1.1). Therefore the function Sopfr(n) is 
similar to the logarithm. One can easily prove the following identity for it: 

Sopfr(ni ■ 77,2) = Sopfr(m) + Sopfr(n2). (1-3) 

The Sopfr(n) function is used in defining Ruth-Aaron pairs named after two 
famous baseball players George Herman Ruth Jr. and Henry Louis Aaron (see [2]). 
In mathematics a Ruth- Aaron pair is a pair of consecutive numbers n and n + 1 
whose sums of prime factors are equal to each other: 

Sopfr(n) = Sopfr(n + 1). (1.4) 

The numbers 714 and 715 constitute the most famous Ruth-Aaron pair. 

Let x be an integer number and let p, q, r, and s be four numbers expressed 
through x by the following four polynomials: 

p = %x + h, (7 = 48x 2 + 24a;- 1, 

(1.5) 

r = 2 x + 1, s = 48 x 2 + 30 x - 1. 
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Using the formulas (1.5), one easily derives that 

pq+l = 2 2 rs 1 p + q = 2-2 + r + s. (1.6) 

Due to (1-6) and (1.3), if p, q, r, s all are prime numbers, then the numbers n = pq 
and n + 1 = 4r s constitute a Ruth- Aaron pair, i. e. they satisfy the equality (1.4). 
Schinzel's H-conjecture (see [3], [4], and [5]) implies that there are infinitely many 
integer numbers x such that the numbers p, q, r, and s given by the polynomials 
(1.5) all are prime. 

In this paper we treat Sopfr(n) as an analog of the primes distribution function 
?r(n). The value ir(n) of this function is defined as the number of positive primes 
less than or equal to n. Gauss and Legendre (see [6]) in 1792-1808 conjectured the 
following asymptotic behavior of the function 7r(n): 

n 

7r ( n ) ~ ~i \ as n ~* 00 • (1-7) 
ln(n) 

In 1849 and in 1852 P. L. Chebyshev proved two propositions very close to (1.7). 
The proposition (1.7) itself was proved in 1896 by Hadamard [7] and Valee Poussin 
[8] . See [9] for the modern explanation of their proof. 

The main goal of this paper is to study the function Sopfr(n) numerically and 
formulate some conjectures similar to (1.7) for this function. 

2. The averaged Sopfr(n) function. 

The Sopfr(n) function is quite irregular. Looking at its graph (see [1]), one can 
find that the values of Sopfr(n) resemble random numbers. In order to make them 
more regular we average them over intervals between two consecutive squares: 

(n + lr — n A 

The function A(n) in (2.1) is the averaged Sopfr(n) function. We study its values 
in two intervals 1 ^ n ^ 998 and 1000 ^ n ^ 3161. The graph of the function (2.1) 
in the first interval 1 ^ n ^ 998 is shown in Fig. 2.1. It is presented by a sequence 
of points whose coordinates are rendered in logarithmic scale, i.e. A n = (x n ,y n ), 
where x n = hx(A(n)) and y n = ln(n). 

Looking at Fig. 2.1 below, one can see that the points A n with n ^ 122 rj e 4 8 
are approximated by a straight line. We write the equation of this straight line as 

x = ay + /3. (2.2) 

In order to calculate the parameters a and (3 in (2.2) we use the root mean squares 
method. For this purpose we use the following quadratic deviation function: 

998 

F(a,P) = £ (x n -ay n -py. (2.3) 

n=122 

The quadratic function (2.3) has exactly one minimum point a = a min , /3 = /3 min . 



A NOTE ON THE Sopfr(n) FUNCTION. 
This minimum point is determined by the following linear equations: 



d_FM) 
da 



0, 



dF(a,(3) 
dp 



0. 



(2.4) 



6 
5 
4 

3 
2 
1 



" ln(n) 










1 1 1 1 1 1 1 1 1 1 1 1 


ln(A(n)) 
i i 1 i i » 



G 



10 



Fig 2. 1 

The function (2.3) and the equations (2.4) are computed numerically. Solving them, 
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we find the numeric values of a and /3 at the minimum of the function F{a, f3): 

a 1.820, /3 « -0.847. (2.5) 

Having calculated the constants (2.5), now we draw the graph of the deviation 
function 5(n) = \n(A(n)) — a ln(n) — (3 in logarithmic scale. The graph of the 
function S(n) in Fig. 2.2 is presented by a series of points A n = (x n ,y n ), where 
x n = ln(n) and y n = 5(n). Looking at this graph, we derive the following inequality 
for the deviation function 5{n): 

-5i < S(n) < 6u where 5i = 0.15 and 122 s$ n ^ 998. (2.6) 

The next interval is 1000 ^ n ^ 3161. The graph of the function (2.1) in 
this interval is shown in Fig. 2.3. Again it is presented by a sequence of points 
whose coordinates are rendered in logarithmic scale, i.e. A n = (x n ,y n ), where 
x n = hi(A(n)) and y n = Ln(n). The graph in Fig. 2.3 is also approximated by a 
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Fig 2.3 

straight line. This straight line is given by the equation (2.2). The coefficients a 
and (3 in this case are calculated by solving the equations (2.4) for the following 
quadratic deviation function, which is similar to (2.3): 

3161 

F(a,P)= (xn~ay n -(3) 2 . (2.7) 

n=1000 

The minimum of the function (2.7) corresponds to the following values of a and f3: 

aw 1.860, -1.115. (2.8) 

The sharpness of the approximation of A(n) by the straight line in Fig. 2.3 is 
expressed through the deviation function 6(n) = hi(A(n)) — a ln(n) — (3, where a 
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and (3 are given by the formulas (2.8): 

-S 2 < S(n) < <fi, where 5 2 = 0.1 and 1000 < n < 3161. 



(2.9) 



The inequalities (2.9) are similar to the above inequalities (2.6). They are derived 
by drawing the graph of the function 5{n). This graph is shown in Fig. 2.4 below. 
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3. Approximation conjectures. 

Note that the parameter a in (2.8) is greater than a in (2.5). This mean that the 
slope of the straight line approximating the graph of the function A(n) slightly grows 
as n —> oo. To take into account this growth we replace the linear approximation 
in (2.2) by a nonlinear one. We choose the following formula for it 

x = ay + (3 + 7 \n(y) + \e~ y + fie~ 2y . (3.1) 

The choice of (3.1) means that A(n) is approximated by the formula 

A(n) « Bn a (Inn) 7 expf - + Ar), where B = e p . (3.2) 
\ n n J 

In order to find the optimal values of the parameters a, /3, 7, A, and [i for the 
approximation (3.2) we apply the root mean squares method. Instead of (2.3) and 
(2.7) in this case we use the following deviation function: 

3161 

F= ^(z n -ay n -/?-7lny n -Ae-^ -^e~ 2 ^) 2 . (3.3) 

71=4 
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Remember that x n = \n(A(n) and y n = ln(n) in (3.3). The optimal values of a, /3, 
7, A, and y, are determined by solving the equations 

<«) 

The equations (3.4) are similar to the equations (2.4). Here is their solution: 

a « 2.001, /3w -0.047, 7 « -1.056, A ps 1.187, /Lt « -2.240. (3.5) 

The exponential factor with A and /x in (3.2) is a decreasing function of n. For 
this reason we consider the function 



B(n) = 



A(n) 



n a (lnn)T 

and draw its graph. Like the graph of A(n), it is presented as a sequence of points: 
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Fig 3.1 

Looking at the graph in Fig. 3.1, we can formulate the following conjecture. 

Conjecture 3.1 (weak Sopfr(n) conjecture). There are jour constants a, 7, Bi 
and B 2 such that the averaged Sopfr -function A(n) in (2.1) obey the inequalities 



Bi n a (ln?i) 7 < A(n) < B 2 n a (Inn) 7 for all n>l. 



(3.6) 



The graph points in Fig. 3.1 condense to a band as n — > 00. Its width is restricted 
by the constants B\ and B 2 in (3.6). The width of this band can vanish at infinity. 
For this option we can formulate the following conjecture. 
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Conjecture 3.2 (strong Sopfr(n) conjecture). There are three constants a, j, 
and B such that B > and the following condition is fulfilled: 



Note that the constants a and 7 in (3.5) are are very close to integer numbers. 
Therefore we can formulate another conjecture. 

Conjecture 3.3. The constants a and 7 either in (3.6) or in (3.7) are explicit 
numbers a — 2 and 7 = — 1 . 

The averaged Sopfr(n) function (2.1) is similar to the primes distribution func- 
tion. The above formulas (3.6) and (3.7) are similar to the formula (1.7). 
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as n — \ 00. 



(3.7) 
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